Encoding information on adjectives in a lexical-semantic net for computational applications

نویسندگان

  • Antonietta Alonge
  • Francesca Bertagna
  • Nicoletta Calzolari
  • Adriana Roventini
  • Antonio Zampolli
چکیده

The goal of this paper is to describe how the EuroWordNet framework for representing lexical meaning is being modified within an Italian National Project in order to include information on adjectives. The focus is on the 'new' semantic relations being encoded and on the revisions we have made to the EuroWordNet Top Ontology structure. We also briefly discuss the utility of the information which is being encoded for computational applications. I n t r o d u c t i o n The Princeton WordNet (henceforth WN) is a lexical semantic network in which the meanings of words are represented in terms of their conceptual and lexical relations to other words. The basic notion around which it is developed is that of a synset (synonyms set), i.e. a set of words with the same Part-of-Speech (PoS) that can be interchanged in a certain context. Various conceptual and lexical relations are then encoded between synsets of the same PoS: e.g., hyponymy, antonymy, meronymy, etc. (Miller et al. 1990; Fellbaum 1998b). Within the EuroWordNet (henceforth EWN) project I a similar (multilingual) lexical resource was developed, retaining the basic underlying design of WN, but enriching the set of lexicalsemantic relations to be encoded for nouns and verbs in various ways 2, in order to obtain a maximally re-usable resource for computational applications. Thus, a) cross-PoS (xPos) relations were added so that different surface realizations of similar concepts within and across languages could be matched (e.g., the noun research and the verb to research could be linked as 1 EWN was a project in the EC Language Engineering (LE-4003 and LE-8328) programme. In a first phase, the partners involved were the University of Amsterdam (coordinator); the Istituto di Linguistica Computazionale, CNR, Pisa; the Fundacion Universidad Empresa (a cooperation of UNED, Madrid, Politecnica de Catalunya, Barcelona, and the University of Barcelona); the University of Sheffield; and Novell Linguistic Development (Antwerp), changed to Lemout & Hauspie during the project. In a further phase, the database was extended with German, French, Estonian and Czech. Complete information on EWN can be found at its web site: http://www.hum.uva.nl/~ewn/. 2 Adjectives and adverbs were encoded in EWN only as targets of relations from nouns and verbs.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Construction of Persian ICT WordNet using Princeton WordNet

WordNet is a large lexical database of English language, in which, nouns, verbs, adjectives, and adverbs are grouped into sets of cognitive synonyms (synsets). Each synset expresses a distinct concept. Synsets are interlinked by both semantic and lexical relations. WordNet is essentially used for word sense disambiguation, information retrieval, and text translation. In this paper, we propose s...

متن کامل

Developing a Semantic Similarity Judgment Test for Persian Action Verbs and Non-action Nouns in Patients With Brain Injury and Determining its Content Validity

Objective: Brain trauma evidences suggest that the two grammatical categories of noun and verb are processed in different regions of the brain due to differences in the complexity of grammatical and semantic information processing. Studies have shown that the verbs belonging to different semantic categories lead to neural activity in different areas of the brain, and action verb processing is r...

متن کامل

Learning Subjective Adjectives from Corpora

Subjectivity tagging is distinguishing sentences used to present opinions and evaluations from sentences used to objectively present factual information. There are numerous applications for which subjectivity tagging is relevant, including information extraction and information retrieval. This paper identifies strong clues of subjectivity using the results of a method for clustering words accor...

متن کامل

Extension and Use of GermaNet, a Lexical-Semantic Database

This paper describes GermaNet, a lexical-semantic network and on-line thesaurus for the German language, and outlines its future extension and use. GermaNet is structured along the same lines as the Princeton WordNet (Miller et al., 1990; Fellbaum, 1998), encoding the major semantic relations like synonymy, hyponymy, meronymy, etc. that hold among lexical items. Constructing semantic networks l...

متن کامل

Senses of Polysemous Nouns: Building a Computational Lexicon of Basic Japanese Nouns

We have constructed the IPA Lexicon of Basic Japanese Nouns (IPAL-BN), which has a hierarchical structure based on the syntactic and semantic properties of no,ms. In our lexicon, each lexical entry consists of subentries, and subentries have semantic property information. Among these elements, we focus here on the subentry description. Conventional Japanese dictionaries only enmnerate various u...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000